A New English/Arabic Parallel Corpus for Phishing Emails

نویسندگان

چکیده

Phishing involves malicious activity whereby phishers, in the disguise of legitimate entities, obtain illegitimate access to victims’ personal and private information, usually through emails. Currently, phishing attacks threats are being handled effectively use latest email detection solutions. Most current systems assume be English, though other languages growing. In particular, Arabic is a widely used language therefore represents vulnerable target. However, there significant shortage corpora that can develop systems. This paper presents development new English-Arabic parallel corpus has been developed from anti-phishing share task text (IWSPA-AP 2018). The content was translated, had allotted 10 volunteers who university background were English experts. To evaluate effectiveness corpus, we models using Term Frequency–Inverse Document Frequency (TF-IDF) Multilayer Perceptron 1258 emails have equal ratios experimental findings show accuracy reaches 96.82% for dataset 94.63% providing some assurance potential value developed.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Managing Phishing Emails: A Scenario-Based Experiment

In this paper, the authors report on a collaborative research project that investigates how people respond to phishing emails compared to genuine emails and what factors contribute to this behaviour. A scenario-based, role-play experiment was conducted by administering a web-based questionnaire via a series of seminars facilitated by a member of the research team. This questionnaire asked each ...

متن کامل

MDMap: Assisting Users in Identifying Phishing Emails

Email-based online phishing is one of the key security threats that greatly deteriorate the trustworthiness of the Internet. Although many spam filters have been developed and deployed, a non-negligible number of phishing emails still sneak into users’ inboxes each day. Phishing emails often contain suspicious information that separate them from the legitimate ones; however, average non-expert ...

متن کامل

Detecting Phishing Emails the Natural Language Way

Phishing causes billions of dollars in damage every year and poses a serious threat to the Internet economy. Email is still the most commonly used medium to launch phishing attacks [1]. In this paper, we present a comprehensive natural language based scheme to detect phishing emails using features that are invariant and fundamentally characterize phishing. Our scheme utilizes all the informatio...

متن کامل

Detection Phishing Emails Using Features Decisive Values

Phishing emails are messages designed to fool the recipient into handing over personal information, such as login names, passwords, credit card numbers, account credentials, social security numbers etc. Fraudulent emails harm their victims through loss of funds and identity theft. They also hurt Internet business, because people lose their trust in Internet transactions for fear that they will ...

متن کامل

a new type-ii fuzzy logic based controller for non-linear dynamical systems with application to 3-psp parallel robot

abstract type-ii fuzzy logic has shown its superiority over traditional fuzzy logic when dealing with uncertainty. type-ii fuzzy logic controllers are however newer and more promising approaches that have been recently applied to various fields due to their significant contribution especially when the noise (as an important instance of uncertainty) emerges. during the design of type- i fuz...

15 صفحه اول

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: ACM Transactions on Asian and Low-Resource Language Information Processing

سال: 2023

ISSN: ['2375-4699', '2375-4702']

DOI: https://doi.org/10.1145/3606031